8-5 DTW for Speaker Identification

One of the typical applications of DTW is text-independent speaker identification. The application is divided into two stages:

At the registration stage, each speaker is required to pronounce several utterances as the spoken passwords.
At the application stage, the speaker pronounces one of the spoken keywords and the system is required to find the identity of the speaker by comparing the spoken keywords against those keywords received at the registration stage. The comparisons are usually achieved by DTW for its robustness to variance in speech rate.

For instance, within the "dataSet" directory of the ML toolbox, we have a collection of recordings consisting of two sessions of 3 subjects each. Each subject was requrested to pronounce 10 spoken passwords for 3 times at each session. So each folder within a session contains 30 recordings for each subject.
First of all, it is a good programming habit to put all parameters related to our task into a function, which return a structure variable containing all parameters:
Example 1: speakerIdTextDependent/sidPrmSet.m

Note that the above function also puts required toolboxes into the MATLAB search path.
To read the data from the folder, see the next example:
Example 2: speakerIdTextDependent/goFeaExtract.m

To check data consistency, see the next example:
Example 3: speakerIdTextDependent/goDataCheck.m

To evaluate the performance using DTW, try the next example:
Example 4: speakerIdTextDependent/goPerfEval.m

After obtaining the overall recognition rate, we can compute statistics of each person, and also list the misclassified utterances with their false output, as shown in the following example:
Example 5: speakerIdTextDependent/goPostAnalysis.m

This is a very important step toward error analysis for further improve the classification system.
Data Clustering and Pattern Recognition (資料分群與樣式辨認)